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RECOMBINANT COFFEE BEAN tt-GALACTOSIDASE 

Technical Fielcj 

The invention is directed to recombinant 
production of coffee bean a-galactosidase for use in 
5 modifying human red blood cells. More specifically, 
the invention concerns a recombinant enzyme useful in 
the conversion of Type B red blood cells to cells 
having the physiological effect of Type O. 

Background Art 

10 The desirability of obtaining "universal donor" - 

type red blood cells for transfusion is well 
recognized. Red blood cells of the n O n type i.e., 
lacking both the "A" and "B" antigenic determinants 
are suitable. Treatment of the Type A antigenic 

15 determinant with N-acetylgalactoseaminidase to destroy 
its antigenicity has been disclosed in U.S. patent 
4,609,627. Of more relevance to the present invention, 
the use of coffee bean a?-galactosidase for the* 
conversion of Type B antigen to the antigenic 

20 equivalent of Type O has also been described by 

Goldstein in U.S. patents 4,330,619 and 4,427,777. 
Also describing this work are papers by Lenny, L.L., et 
al;, Blood (1991) 77:1383-1388, and by Goldstein, J., 
et al., Science (1982) 215:168-170. 

25 In the work by the Goldstein group, it was found 

necessary to equilibrate the erythrocytes to a pH of 
about 5.7-5.8 before contacting the erythrocytes with 
the or-galactosidase. This equilibration was found to 
prevent the hemolysis noted in attempts to remove the 

3 0 terminal galactose residue from the B determinant using 
conditions similar to those reported by Zarnitz, M.L., 
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e t al., J Am Chem Soc (I960) 82:3953-3957, and by 
Harpaz, N. , et al . , Arch Biochem Biophvs (1975) 
170:676-683. These earlier conversions had been 
conducted at a pH of about 5 . 
5 The coffee bean ar-galactosidase utilized in the 

foregoing studies in U.S. patents 4,330,619 and 
4,42 7,777 is apparently equivalent in purity to the 
commercially available a-galactosidase (EC 3.2.1.22) 
marketed by Boehringer Mannheim. Further purification 
10 by undisclosed methods may have been conducted by 
Goldstein (1982) supra and Lenny (1991) supra. The 
molecular weight of coffee bean a-galactosidase was 
reported to be 26 kD by Barham, D., et al . , Phvtochem 
(1971) 10:1759-1763. The a-galactosidase can be 
15 purified using affinity chromatography with a substrate 
or substrate analog, as described by Courtois, J.E., et 
al., Meth Bnzymol (1966) 8:565-571, and by Harpaz, N. , 
et al., Biochem Biophys Arha (1974) 34JL: 213-221 . As 
shown hereinbelow, the commercial preparation of coffee 
20 bean or-galactosidase available is, in fact, an impure 
mixture of at least four proteins; only a vanishingly 
small amount of this preparation has the molecular 
weight 26 kD. In general, the readily detectable 
proteins in the mixture have molecular weights of 68 kD 
25 (40%), 40 kD (35%) and 36 kD (10%); the 68 kD protein 
is BSA added as a stabilizer. The 4 0 kD protein has 
the Qf-galactosidase activity and corresponds in 
molecular weight to the deduced amino acid sequence as 
described below. 
3 0 The work of Goldstein et al. cited above indicates 

that coffee bean or-galactosidase is the enzyme of 
choice for the conversion of B antigen on erythrocytes 
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to that consistent with universal donor erythrocytes . 
Although it is thus clear that coffee bean 
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ar-galactosidase is desirable for this conversion, 
practical sources for pure preparations of this enzyme 
for such use have not been available. 

The genes or cDNAs encoding a-galactosidases from 
5 other sources have been retrieved and reported. These 
enzymes are not as effective for the degalactosylation 
of B antigen. Most closely related to the 
a-galactosidase of the invention is the enzyme from the 
legume guar (Cyamopsis tetragonal oba) seed. (See 
10 Hughes, S.G., et al . , Plant Mol Biol (1988) 11:783-789? 
Overbeeke, N., et al., Plant Mol Biol (1989) 13:541- 
550; and PCT Application WO87/07641. ) The cDNA 
encoding the guar a-galactosidase has been expressed 
and the protein secreted from B. subtilis, as reported 
15 by Overbeeke, N., Applied Environme nt Mic:roH-ip 1 (1990) 
1429-1434. This enzyme, however, has the disadvantage 
of a considerably lower pH optimum than that of coffee 
bean ar-galactosidase. Retrieval of c*-galactosidase- 
encoding cDNA from coffee bean by methods analogous to 
20 those used for the guar enzyme cDNA is not possible, 
since in the guar seed a-galactosidase is synthesized 
in the aleurone layer during germination and an 
aleurone layer is lacking in coffee beans. 
Accordingly, the procedures followed in obtaining guar 
25 messenger RNA encoding a-galactosidase cannot be 
followed in coffee bean. 

Other sources of a-galactosidase also are known, 
and the genes or cDNAs have been cloned in some cases . 
The gene encoding the a-galactosidase of yeast has been 
30 reported by Liljestrom, P.L., Nucleic Acids Res (1985) 
i3 .-7257-7269; Sumner-Smith, M. , et al . , Gene (1985) 
36:333-340. T h e DNA encoding the enzyme from E. coli 
is described by Liljestrom, P.L., et al . , Nucleic Acidfl 
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Res (1987) JL5: 2213 -2220 - Human a-galactosidase A has 
been implicated in Fabry disease , and cDNA encoding 
this protein has been isolated (Calhoun, D.H. , et al . , 
Proc Natl Acad Sci USA (1985) 83.: 7364 -7368 ; Biship, 
5 D.F., et al., Proc Natl Acad Sci USA (1986) 83:4849- 
4853) . A genomic clone containing the promoter for 
this gene was reported by Quinn, M., et al., Gene 
(1987) 58:177-188, and the recombinant enzyme has been 
suggested as a treatment for Fabry disease, as set 
10 forth in PCT Application W09 0/113 53 and U.S. patent 
5,179,023. 

None of these genes appears suitable for the 
production of a-galactosidase for use in the conversion 
of B erythrocytes, since this range of enzymes has 

15 considerable variance in substrate specificity and 
differences in pH optima. For example, Overbeeke et 
al. (1987 PCT Application WO87/07641) demonstrate that 
a-Gals of plant origin (guar, fenugreek, lucerne, and 
coffee beans) are able to reduce the galactose content 

20 of galactpmannans, whereas microbial (Aspergillus 

niger, Saccharomyces carlsbergensis , and Escherichia 
coli) a-Gals lack this enzymatic capability. Although 
it has been shown that a-galactosidase from Clostridium 
sporogenes (Dybus, S., Transfusion (1983) 23:244-247) 

25 and from soybeans (Harpaz, N. , et al., Eur J Biochem 
(1977) 77:419-426) can remove the terminal galactose 
from B-antigen, these enzymes are more complex and the 
pH optima appear to be too low to effect the desired 
conversion on erythrocytes, since at low pH values, 

30 hemolysis occurs. 

Thus, the present invention provides, for the 
first time, a practical source for purified and 
isolated coffee bean a-galactosidase required for the 
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conversion of B-type red blood cells to red blood cells 
capable of serving as universal donors. 

Disclosure of t he Invention 

The invention is directed to the recombinant 
5 production of coffee bean a-galactosidase (coffee bean 
a-Gal) . The production of this protein in recombinant 
form permits the practical production of sufficient 
quantities of pure enzyme for use in effecting the 
production of universal donor red blood cells. 

10 Thus, in one aspect, the invention is directed to 

DNA encoding coffee bean a-galactosidase having the 
amino acid sequence shown in Figure 1 herein or 
a-galactosidase encoded by the allelic variants of its 
encoding DNA. The invention is also directed to 

15 expression systems for the production of this protein 
and to methods for its production using these systems . 

In other aspects, the invention is directed to 
methods to manufacture universal donor red blood cells 
by treating red blood cells containing Type B antigen 

20 with the recombinant enzyme of the invention, to 

antisense and triple helix- forming oligomers, and to 
antibodies specific for coffee bean a-Gal. 

Brief Description of the Drawings 

Figure 1 shows the DNA encoding coffee bean 
25 a-galactosidase and the deduced amino acid sequence of 
the enzyme. The DNA includes a region encoding a 
preprosequence numbered (-57) -(-l), as well as the 
sequence encoding the mature protein, designated 
positions (+l)-(363). 
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Figure 2 shows the results of melibiose-af f inity 
chromatography purification of coffee bean 
o?-galactosidase as monitored by SDS-PAGE. 

Figure 3 shows a series of Western blots 
5 monitoring the expression of the a-galactosidase gene 
in developing coffee beans. 

Figure 4 shows a diagram of the sequencing 
strategy for of-galactosidase cDNA. 

Modes of Carrying Out the Invention 

10 Coffee bean or-galactosidase (coffee bean a-Gal) is 

provided in recombinant form permitting sufficient 
quantities of purified enzyme to be produced for its 
practical use in treatment of red blood cells 
containing the B antigen. By "coffee bean 

15 <x-galactosidaseV is meant a protein having thet ability 
to remove the terminal a-galactose from the B antigen, 
which has the amino acid sequence shown for the mature 
protein (numbered 1-363) in Figure 1 herein. Also 
included in the definition are proteins having this 

20 activity which are encoded by DNAs that are allelic 
variants of the DNA shown in Figure 1 as encoding the 
mature protein. As allelic variants are very similar 
in sequence to each other, the availability of the DNA 
of Figure 1 provides probes which can readily retrieve 

25 such variants. 

It is also recognized that minor variations in 
amino acid sequence, such as one or two deletions, 
substitutions, additions, or other modifications can 
generally be made without affecting the activity of a 

30 protein. Accordingly, "coffee bean a-galactosidase" as 
defined herein, includes such variants, as well as 
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fragments of the 3 63 -amino acid sequence shown which 
retain activity. 

Described in the examples below is the retrieval 
of the cDNA encoding coffee bean a-galactosidase . As 
5 it occurs in the bean, the mature enzyme is produced as 
a preproenzyme with an upstream sequence of 57 amino 
acids. Analogies with other preproenzymes suggest a 
signal protease cleavage site between residues (-20) 
and (-19); thus, the secretion signal would constitute 
10 residues (-57) -(-20) to yield a proenzyme of the 

sequence shown at (-19) to 363. The mature protein 
contains 363 amino acids, as verified by the N-terminal 
sequence of the isolated mature protein described in 
the examples below. 
15 In addition to providing recombinant materials for 

the production of the enzyme, the invention makes 
available the native nucleic acid sequences encoding 
coffee bean o?-Gal. The sequences provide the 
information required to design oligonucleotides which 
20 regulate the production of this enzyme. The native 

sequence can be used as a basis for such design per se 
or can be used as a probe to retrieve additional 
portions of the DNA region which effects coffee bean 
o?-Gal production. These additional portions can also 
25 be used for such design. These oligomers may be 

"antisense" oligomers which are complementary to the 
single strand encoding these proteins or to the related 
regulatory sequences included in the RNA and/or DNA, or 
may be capable of forming triple helices with the 
3 0 duplex gene either in the coding region or in the 
regulatory regions thereof. 

The availability of the purified enzyme also ' 
permits the preparation of antibody compositions which 
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consist of antibodies immunoreactive with the enzyme. 
In addition to providing suitable immunogens, the 
purified enzyme can be used as a reagent to isolate 
compositions wherein all contained antibodies are 
5 immunoreactive with the enzyme. 

General Description and De finitions 

The practice of the present invention will employ, 
unless otherwise indicated, conventional biochemistry, 
immunology, molecular biology and recombinant DNA 

10 techniques within the skill of the art. Such 

techniques are explained fully in the literature. See, 
e.g., Maniatis et al ., Molecular Cloning: A Laboratory 
Manual (1982); DNA Cloning: A Practical Approach, vol. 
I & II (D, Glover, ed. ) ; Oligonucleotide Synthesis (N. 

15 Gait, ed., 1984); Nucleic Acid Hybridization (B. Hames 
& S. Higgins, eds., 1985); Transcription and 
Translation (B. Hames & S. Higgins, eds., 1984); Animal 
Cell Culture (R. Freshney, ed., 1986); Perbal, A 
Practical . Guide to Molecular Cloning (1984) . 

20 The following terminology will be used in 

accordance with the definitions set out below in 
describing the present invention. 

DNA "control sequences" refers collectively to 
promoter sequences, ribosome binding sites, 

25 polyadenylation signals, transcription termination 

sequences, upstream regulatory domains, enhancers, and 
the like, which collectively provide for the 
transcription and translation of a coding sequence in a 
host cell. 

3 0 A coding sequence is "operably linked to" control 

sequences when expression of said coding sequences is 
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effected when the expression system is contained in an 
appropriate host cell. 

A "host cell" is a cell which has been modified to 
contain, or is capable of modification to contain, an 
5 exogenous DNA sequence. 

Antibodies which are "specifically immunoreactive" 
with a referent antigen refers to antibodies which are 
capable of binding to such antigen with a perceptibly 
greater affinity as compared to the ability to bind 

10 other antigens. The level of difference in affinity 
required depends on the nature of the application. 
Preferably, the referred-to antigen is bound with an 
affinity at least ten time greater than that 
descriptive of binding to a contaminant and more 

15 preferably with 100 times more affinity. 

When the enzyme of the invention is described as 
being in "purified and isolated" form, the enzyme is 
described in a state wherein the composition in which 
it resides is at least 90% by weight composed of the 

20 enzyme, preferably 95% by weight and more preferably 

99% by weight with respect to the organic components of 
the preparation. 

Recombinant Production 

The availability of the gene encoding coffee bean- 

25 or-galactosidase permits the efficient production of . the 
recombinant material. The desired coding sequence 
that of the effective fragment, mature protein, 
proenzyme, or preproenzyme , or fragments thereof --is 
ligated into suitable expression systems for 

30 recombinant production. A wide variety of such 

expression systems and corresponding host cells is by 
now known in the art. The desired active protein may 
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be produced intracellularly or secreted, depending on 
whether or not suitable secretory leader sequences are 
included in the expression system. Further, the active 
protein may be produced in the form of a fusion protein 
5 which may be active per se or which may be cleaved to 
yield the desired enzyme. All of these variations are 
by now standard in the art. Suitable expression 
systems include both bacterial systems and eucaryotic 
systems, including those of yeast, mammalian cells, 
10 insect cells and plants. 

In more detail, the coffee bean a-Gal of the 
invention can be produced by constructing an expression 
system and modifying a host cell to contain this system 
to provide a cell line or culture capable of expressing 
15 DNA encoding the enzyme. DNA encoding the enzyme or 
active fragments can either be prepared directly by 
synthetic methods based on the native sequence (or 
equivalent sequences encoding the same amino acids) , or 
by using the native sequence to design oligonucleotide 
20 probes to. retrieve the coding sequence using known 

techniques. See, e.g., Mayfield et al, J Virol (1983) 
4.:259-264. The coding sequence can be comprised 
entirely of the mature protein-encoding sequences, or 
such sequences can be fused to other sequences (e.g., 
25 leader sequences) so that a. fusion protein is encoded. 
See, e.g., U.S. Pat. Nos. 4,431,739; 4,425,437; 
4,338,397. Synthetic coding sequences will also allow 
for the convenient construction of coding sequences 
which express coffee bean ex-Gal modified as described 
30 above. Alternatively, coding sequences for these 
modified forms can be prepared by site-directed 
mutagenesis of native nucleotide sequences. The 
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techniques of site-directed mutagenesis are known in 
the art. 

To complete construction of an expression system, 
the coding sequence as described above for the coffee 
5 bean or-Gal is then operably linked to control sequences 
(e.g., a promoter, etc.), so that the DNA sequence 
encoding the enzyme is transcribed into messenger RNA 
in the host cell modified to contain the expression 
system. 

10 For expression in a procaryotic, yeast or 

mammalian cell, the promoter and other control 
sequences are usually heterologous to the enzyme- 
encoding sequence . The expression system may be 
constructed as a discrete molecular entity flanked by 

15 convenient restriction sites; alternatively, it may be 
constructed by inserting the coding sequence into a 
previously constructed expression vector with an 
appropriate insertion site. 

A number of procaryotic expression vectors are 

20 known in the art. See, e.g., U.S. pat. Nos. 4,440,859; 
4,436,815; 4,431,740; 4,431,739; 4,428,941; 4,425,437; 
4,418,149; 4,411,994; 4,366,246; 4,342,832; see also 
U.K. Publication Nos. GB2,121,054; GB2,008,123; 
GB2,007,675; and European Publication No. 103,395. 

25 Preferred procaryotic expression vectors are those for 
E. coli . Yeast expression vectors are also known in 
the art. See, e.g., U.S. Pat. Nos. 4,446,235; 
4,443,539; 4,430,428; see also European Publication 
Nos. 103,409; 100,561; 96,491. 

30 Expression may also be achieved in mammalian 

cells. Appropriate mammalian expression hosts include 
kidney cell lines (e.g., CV-1 monkey kidney cell 
lines) , fibroblast cell lines, Chinese hamster ovary 
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(CHO) cells, HeLa cells, mouse NIH/3T3 and/or LMTK 31 
cells. Alternatively, the coffee bean a-Gal may be 
expressed in myeloma cell lines employing 
immunoglobulin promoters- See, e.g., Banerjle et al. 
5 Cell (1983) 33:729-740; U.S. Pat. No. 4,663,281. 

Mammalian expression vectors employing viral promoters 
(e.g., SV40 early region promoter, Rous sarcoma virus, 
LTR promoter, etc.) are also well known in the art. 
See, e.g., Pachl et al., J Virol (1987) 61:315-^325; 

10 Gorman et al., Proc Natl Acad Sci USA (1982) 79:6777- 
6781; Southern et al., J Mol App Genet (1982) 1:327- 
341; PCT Publication No. W087/02062. Preferred 
eucaryotic expression vectors employ the vaccinia 
virus, the SV40 virus, or the Rous sarcoma virus. See, 

15 e.g., Mackett et al., J Virol (1984) 49:857; DNA 

Cloning, vol. II, pp. 191-211, supra; PCT Publication 
No. WO86/07593; Chakrabarty et al . , Mol Cell Biol 
(1985) 5:403. 

A host cell that has been stably transformed by an 
20 expressipn system for the enzyme is then selected to 
produce the recombinant coffee bean a-Gal. 

Thus, the expression systems are constructed using 
standard recombinant techniques of restriction enzyme 
cleavage, modification, if necessary, effected by site- 
25 directed mutagenesis, and ligation. The expression 

systems are amplified using cloning or the polymerase 
chain reaction (PCR) and included in vectors suitable 
for modifying host cells so as to contain the 
expression systems of the invention. Once such 
3 0 modified hosts are obtained, the cells are cultured 

under conditions appropriate for the choice of host and 
under conditions wherein the control sequences 
contained in the expression systems effect the 
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expression of the encoding DNA. The recombinant coffee 
bean a-galactosidase produced is then recovered from 
the culture and purified using standard techniques such 
as affinity chromatography, ion-exchange 
5 chromatography/ reverse-phase chromatography, and the 
like. 

Use and Industrial Applicability 

The purified enzyme is then available for use in 
removal of the terminal a-galactose from B antigen. 

10 Application of the enzyme for this purpose is described 
according to the methods of Goldstein et al . , supra. In 
general, the method involves preequilibrat ion of the 
red blood cells to a pH of 5.7-5.8 followed by 
treatment with the enzyme for suitable time periods and 

15 recovery of the red blood cells, restoring them to 
physiological pH. 

Red blood cells suitable for use in the method of 
the invention include those bearing B antigens, i.e. 
Type B as. well as Type AB red blood cells. In the case 

20 of AB antigen-characterized red blood cells, additional 
treatment with N-acetylgalactosaminidase is required to 
remove the A antigen in order to obtain a universal 
donor cell . 

In addition to the use of the coffee bean a-Gal 
25 enzyme of the invention to produce universal donor 
cells as described above, the recombinant enzyme 
provides an adequate supply for other uses . For 
example, U.S. patent 5,179, 023 discloses the use of 
a-galactosidase in the treatment of Fabry's Disease. 
3 0 of-Galactosidase is also used to reduce the content of 
galactomannans in various foodstuffs. See, for 
example, Japanese applications JP 61/274695; JP 
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12/47050; JP 12/47061. a-Galactose has also been used 

for the treatment of guar gum — i.e. galactomannan — 

to reduce the galactose content and alter physical 

properties of foodstuffs as disclosed in PCT 
5 application WO87/07641. 

The use of a-galactosidase to hydrolyze raffinose 

has been disclosed with respect to sugar beet extract. 

Porter, et al. (1991) _; Overbeeke, m 

et al. (1989) _. It has also been used to make 

10 raffinose available for more complete utilization of 

molasses as a fermentation feedstock as disclosed in 

European application 241044. 

The use of a-galactosidase to treat foods or as a 

dietary supplement to destroy components that cause 
15 flatulence has also been described by Porter, et al. 

(1990) supra and Overbeeke, et a J. (1989) supra and in 

PCT application WO90/14101. 

Thus, the coffee bean a-Gal of the invention has a 

multiplicity of uses. All of them benefit from the 
20 availability of recombinant forms of coffee bean a-Gal. 

Regulation of Expression 

It may be desirable to regulate the expression of 
coffee bean a-Gal in its native environment or in the 
recombinant host. The invention provides the native 

25 DNA sequence encoding mature or preproenzyme of coffee 
bean a-Gal which makes possible the design of suitable 
antisense and triple helix- forming oligonucleotides 
that can interrupt the expression of the gene . 

Antisense oligonucleotides are generally designed 

30 as complements to the messenger RNA encoding the 

desired protein. The complement binds through Watson- 
Crick base-pairing to the mRNA interfering with 
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translation either by enhancing mRNA degradation by 
RNAse H, by preventing or inhibiting processing to 
mature RNA, or by preventing translation. The 
oligonucleotide may bind either to the translated 
5 region or to control sequences in the mRNA. 

Similarly, as the transcription of DNA involves 
partial disassembly of the double helix, antisense 
oligonucleotides may also bind to transcribed or 
nontranscribed regions of the DNA to inhibit 

10 transcription. Absolute homology between the target 
and the antisense sequences is preferred but not 
required for the inhibition. Holt, J.T. et al . , Proc 
Natl Acad Sci (1986) 8^:4794. 

Oligonucleotides may also be designed to form a 

15 triplex DNA structure with the intact duplex gene 
according to certain binding rules. Moffat, A.S., 
Science (1991) 252 : 1374-1375. When this triplex 
structure is formed in the promoter region of a gene, 
it has been shown to disrupt transcription of that 

20 gene. Orson, F.M. et al., Nuc Acids Res (1991) 

JL9: 3435-3441 . Again, the oligomer designed to form a 
triplex can be designed to bind the duplex gene in 
either regulatory or transcribed regions or both. 

The invention, therefore, also includes methods 

25 and compositions useful to regulate the production of 
the coffee bean enzyme by use of the antisense or 
triple helix-forming techniques. The relevant 
oligomers may be delivered to cells containing the 
expression systems or to the native developing coffee 

30 bean. 



r 
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Preparation of Antibodies 

The production of polyclonal and monoclonal 
antibodies reactive with the purified enzyme is within 
the skill of the art. A mammal, such as a mouse, is 
5 immunized with the enzyme, or a fragment or a precursor 
containing the relevant epitopes. The serum may be 
harvested as a polyclonal composition. The 
availability of purified coffee bean a-Gal makes 
possible the production of compositions of antibodies 

10 substantially all of which bind this enzyme or its 
precursors. Standard immunoaf f inity chromatography 
techniques can be employed for this purpose. The 
resulting compositions are useful in quality control 
assays for preparation of the enzyme as well as in 

15 assay methods for levels of coffee bean a-gal as an 
index to seed development . 

Antibody-producing B lymphocytes can also be 
recovered from the animal and immortalized by, for 
example, fusion with a continuous cell line to produce 

20 an immortal antibody- producing cell line; i.e., a 

hybridoma, trioma, etc. Immortal, antibody-producing 
cell lines can also be created by techniques other than 
fusion, such as direct transformation of B-lymphocytes 
with oncogenic DNA, or transfection with Epstein-Barr 

25 virus. See, e.g., M. Schreier et al . , Hybridoma 
Techniques (1980); Hammerling et al., Monoclonal 
antibodies and T-Cell Hybridomas (1981); Kennett et 
al., Monoclonal Antibodies (1980); see also U.S. Pat. 
Nos. 4,341,761; 4,399,121; 4 , 427 , 783 ; 4 , 444 , 887 ; 

30 .4,451,570, 4,466,917; 4,472,500; 4,491,632; 4,493,890. 
After screening, monoclonal antibody is harvested from 
the immortalized cell lines utilizing conventional 
separation and purification techniques. 
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The antibodies thus prepared are useful in 
purification procedures for isolation of highly- 
purified coffee bean Qf-Gal and for assessing levels of 
this enzyme in situ. 
5 The following examples are intended to illustrate , 

but not to limit , the invention. 

Example 1 

Purification and N-Terminal Sequencing 
of Coffee Bean q-Galactosidase 

10 Coffee bean a-galactosidase manufactured by 

Boehringer Mannheim Biochemicals (BMB a-Gal) was 
initially analyzed by SDS-PAGE and found to contain 
about 40% of a 68 kD protein (BSA added as stabilizer) ; 
35% of a 40 kD protein; 10% of a 36 kD protein; and 15% 

15 of several minor proteins including an approximately 26 
kD protein (corresponding to the molecular weight for 
a-galactosidase from coffee bean previously reported) . 

For affinity chromatography, the commercial 
preparation containing 500 fig total protein in 500 /il 

20 was buffer- exchanged by centrifugal ultrafiltration 
(Centricon 30, Amicon) into 0.1X assay buffer. (IX 
buffer is 123 mM Na 2 HP0 4/ 37 mM citric acid, pH 6.0.) 
The sample was loaded onto a 1 ml melibiose-agarose 
column (Sigma) equilibrated with 0.1X assay buffer at 

25 4°C, and one 0.5 ml fraction was collected. The a-Gal 
was eluted with 5 volumes of IX assay buffer containing 
15 mM p-nitrophenyl-a-D-galactoside (PNPG) , and ten 0.5 
ml fractions were collected. Fractions were analyzed 
for enzymatic activity and by SDS-PAGE. To assess 

30 enzymatic activity, 12.5 /xl of each fraction was 

combined with 150 fil of IX assay buffer containing 10 
mM PNPG in a microtiter plate well, incubated for 1.5 
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hours at room temperature, and the optical density at 
410 nm was determined in a microtiter plate reader. 

The resulting elution pattern is shown in Figure 
2, as analyzed by SDS-PAGE of the fractions. Lane 1 
5 represents molecular weight markers; lane 2, flow- 
through after binding; lanes 3-12, fractions collected 
during washing; lanes 13-17, fractions collected during 
elution; and lane 18, a sample of the BMB a-Gal 
starting material before application to the column. 

10 The fraction shown in lane 14, which contains 

essentially pure 40 kD protein, was determined to have 
the highest level of a-galactosidase activity. 

A portion of the 40 kD protein purified in this 
way was subjected to N-terminal sequence determination 

15 using an Applied Biosystems 4778-120A Protein 
Sequencer, and the first 12 amino acids were 
determined. This sequence showed a high degree of 
homology to the corresponding guar protein and less 
homology to the human and yeast counterparts . The 

20 sequence obtained is identical to the first 12 amino 
acids shown as positions 1-12 in Figure 1. 

Example 2 

Preparation of Anti-gy-Galactosidase Antibodies 
Commercial coffee bean a-Gal (BMB) (3 00 /xg) was 

25 buffer- exchanged (Centricon 30, Amicon) into phosphate 
buffered saline (PBS: 137 mM NaCl, 2.7 mM KCl, 4 . 3 mM 
Na 2 HP0 4 -7H20, 1.4 mM KH 2 P0 4 ) . The sample was 
electrophoresed through a 12% total (2.67% crosslink) 
SDS-polyacrylamide gel using standard conditions. The 

30 40 kd protein band was visualized by soaking the gel in 
0.1 M KCl at 4°C for 15 minutes which precipitates the 
proteins in situ. The 4 0 kD ar-Gal band was excised 
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from the gel, transferred to a 1.5 ml microcentrifuge 
tube, and homogenized in 200 fil of PBS . The final 
mixture, 300 pi, contained approximately 12 0 pg of 
af-Gal . 

5 Two Balb-C mice were each immunized 

intraperitoneally (IP) with 60 /xg of the a-Gal 
preparation mixed 1:1 with complete Freund's adjuvant. 
Mice were boosted IP on day 14 and 28 with 
approximately 30 /xg of isolated 40 kD band mixed 1:1 

10 with incomplete Freund's adjuvant. Test bleeds (tail) 
were taken on day 21 and 35 to evaluate the titer of 
anti-a-Gal antibodies. Hyperimmune mouse ascites fluid 
(HMAF) was produced by injecting IP approximately 2 x 
10 8 sarcoma 180 cells per mouse 15 days after the second 

15 boost. Ascites fluid was collected on days 8 and 12 

post induct ion. Test bleeds and HMAF were evaluated by 
Western blots of BMB coffee bean a-Gal transferred to 
Nitroplus 2000 membrane (Micron Separations, Inc.). 

Example 3 

20 Analysis of or-Galactosida se Levels in 

Developing Coffee Beans 
Fresh coffee fruits were separated into eleven 
developmental stages, 1 to 11. Stages 1-5 contain no, 
1/4 full, 1/2 full, 3/4 full and full endosperm, 

25 respectively, as graded by inspection of seed endosperm 
after taking a center cross-sectional cut through the 
green cherries. Stages 6-10 are based on visible 
ripening of the fruit wherein the epicarp (skin) 
progresses from green-yellow (stage 6) to yellow-green 

3 0 (stage 7) to yellow (stage 8) to yellow-red (stage 9) 

to red (stage 10), and stage 11 represents fully mature 
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cherries dried in the field, roughly equivalent to the 
harvested and dried fruit used for roasting. 

The pulp (mesocarp) and parchment (endocarp) were 
removed from four beans (two fruits) per developmental 
5 stage, and the beans were sliced into 8 to 10 cross 
sections and incubated in PBS for 2 hours on ice . The 
extracts were then cleared by centrifugation at 5,000 x 
1 g at 4°C for 5 minutes. Samples representing 1/2 00th 
of each extract were prepared for SDS-PAGE and Western 

10 blot analysis by mixing with protein sample buffer and 
boiling for 3 minutes. 

Western blots were blocked with Tris-buf f ered 
saline/tween 20/milk (TBST-milk: 150 mM NaCl, 10 mM 
Tris-Cl, pH 8.0, 0.05% tween 20, 1% nonfat powdered 

15 milk) for 1 hour, incubated with a 1:800 dilution of 

the anti-a-Gal HMAF described in Example 2 in TBST-milk 
for 1 hour, washed with TBST, incubated with a 1:7500 
dilution of goat ant i -mouse alkaline phosphate- 
conjugated antibody (Promega) in TBST-milk for 45 

20 minutes, washed with TBST, and reacted with substrate 

(BCIP and NBT, Promega) according to the manufacturer's 
recommendations. 

Figure 3 shows the results of this Western blot 
analysis. Lane 1 contains molecular weight markers; 

25 lane 2 is blank; lanes 3-10 represent developmental 
stages 4-11; lane 11 is blank; and lane 12 is 
commercial coffee bean a-galactosidase . No 
a-galactosidase was detected in stages 1-4; the enzyme 
was first detected in stages 5-6; the peak activity 

30 appeared at stages 6 and 7. 
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Example 4 

Isolation of or-Galactosidase-Encodina RNA 
RNA was extracted from stages 5 and 6 coffee beans 
using a modification of the method of Georgini, J.F., 
5 Brazilian J Med Biol Res (1988) 21:811-824, 

For each isolation two hundred coffee beans (100 
fruits, approximately 50 g) were peeled and immediately 
frozen in liquid nitrogen, then partially thawed at 
room temperature and 120 ml of room temperature TES 

10 extraction buffer (200 mM Tris-Cl, pH 7.5, 10 mM EDTA, 
1% SDS) was added. When fully thawed (temperature kept 
below 4°C), the beans were ground in a Sorvall 
omnimixer for 1.5 minutes on setting 6, and then 
homogenized with a Polytron homogenizer using six 15 

15 second pulses at setting 6. The homogenate was 

filtered through miracloth (Calbiochem) and divided 
into six aliquots. Each aliquot was extracted with a 
half volume of TES-buffered phenol, and the phases were 
separated by centrif ugation at 25,000 x g for 10 

20 minutes. . The aqueous phase from each aliquot was mixed 
with an equal volume of CsCl solution (100% w/v, 
density approximately 1.7 g/ml) , and the RNA was then 
pelleted through a cushion of CsCl . 

The cushions were set up as follows: 10 ml of 100% 

25 (w/v) CsCl solution was added to a polyallomer 
centrifuge tube, and 28.5 ml of the coffee bean 
extract-CsCl solution was carefully overlaid. The 
gradients were centrifuged in a Beckman SW28 rotor at 
25,000 rpm for 18 hours at 20°C. The resulting RNA 

30 pellets were dissolved in 0.3 M NaOAc and precipitated 
with two volumes of ethanol. The RNA was collected by 
centrif ugation, dissolved in 100 fil of water, and 
quantitated by UV spectrophotometry. 
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Poly (A) + RNA was prepared using Promega's 
PolyATract magnetic sphere kit. Approximately 1 mg of 
total RNA was used in a small scale isolation protocol 
provided by the manufacturer. The resulting poly (A) + 
5 RNA was quant it at ed by UV spectrophotometry. Typical 
yields for each 200-bean preparation were about 1.5 mg 
of total RNA containing about 0.5-0.6% polyadenylated 
RNA. The quality of total and poly (A) + RNA was 
evaluated by standard MOPS /EDTA- formaldehyde gel 

10 electrophoresis using 0.66 M formaldehyde. 

Poly (A) + RNA was also evaluated by reverse 
transcription assays performed in a 60 fil volume 
containing 1 fig of poly (A) + RNA, 12 fil 5X buffer 
(Bethesda Research Laboratories, BRL) , 15 units RNasin 

15 (Pharmacia), 500 (M each dNTPs, 25 fiC± a^P-dCTP, 100 

pmole oligo-d(T) primer, and 4 00 units of M-MLV reverse 
transcriptase (BRL) . The reaction was incubated at 
37°C for 10 min with all components minus enzyme, then 
for 3 0 min at 37° C with enzyme, and stopped by cooling 

20 on ice. Control reactions contained 1.5 fig of purified 
globin mRNA (BRL) . The percent incorporation was 
determined by counting total and bound counts on DEAE 
membrane (Schleicher and Schuell, NA45) . The isolated 
coffee bean mRNA gave 32 P- incorporation of about 0.6% of 

25 label versus about 2.5% in control reactions with 
purified globin mRNA. 

Example 5 
Construction and Screening 
of a cDNA Expression Library 
30 Approximately 3.5 /ig of poly (A + ) RNA prepared from 

stages 5 and 6 was used to construct an expression 
library lambda in vector UniZAPII (Stratagene) , using 
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the ZAP-cDNA synthesis Kit (Stratagene) . The kit 
synthesizes 1st strand cDNA using an oligo-d(T) 
primer/adaptor (which contains an unmethylated Xhol 
site), M-MLV reverse transcriptase, and 5 -methyl - dCTP . 
5 Synthesis of 2nd strand cDNA is based on the RNase H 
procedure described by Gubler and Hoffman, Gene (1985) 
25:263. Following 2nd strand synthesis, EcdRI adaptors 
were added, and the cDNA was digested with Xhol . 
Methylated Xhol sites are resistant to Xhol cleavage. 

10 Incorporation of the £7coRI and Xhol sites enables 
directional cloning. The synthesized cDNA was size 
fractionated on a sephacryl S-400 (Pharmacia) spin 
column and ligated to vector DNA. The ligated DNAs 
were packaged with Gigapack II Gold lambda phage 

15 extracts (Stratagene) and, subsequently titered using 
E. coll strain PLK-F' (Stratagene) which is capable of 
accepting methylated cDNA. The primary library was 
amplified in E. coll PLK-F' by plating approximately 1 
x 10 5 pfu on each of ten 150 mm plates and eluting the 

20 resulting phage from the plates. 

Eleven phage were randomly picked from the library 
and determined to contain inserts using the polymerase 
chain reaction (PCR) with primers (pUC forward and 
reverse) that flank the multiple cloning sites in the 

25 vector. The average size of the inserts was 

approximately 700 bp. Amplification of the primary 
library resulted in a titer of 1 x 10 10 pfu/ml. 

A total of 6 x 10 5 pfu from the amplified library 
were plated with the E. coll strain XL-l-blue 

30 (Stratagene) on twelve 150 mm plates and screened with 
the anti-cv-galactosidase antibody of Example 2. The 
plates were incubated at 42°C for 4 hours, expression 
of cDNA inserts was induced by placing membranes 



WO 95/06478 



PCT/US94/09739 



- 25 - 

(Nitroplus 2000, 0.45 pm, Micron Separations, Inc . ) 
previously soaked in 10 mM IPTG on the plates. After 
incubating at 37°C for 3.5 hours, the first filter was 
removed, and a second IPTG-soaked filter was placed on 
5 the plate, incubated at 37°C for 5.5 hours, and then 
removed. Filters were rinsed in TBST for approximately 
1 minute, air dried, and stored in 4°C overnight. 

The filters were immunoprobed using a 1:500 
dilution of the anti-a-Gal antiserum that was 
10 preadsorbed with an E. coli extract (Stratagene) as 
recommended by the manufacturer. All steps of the 
immunoscreening procedure were carried out at room 
temperature using approximately 10 ml of solution per 
filter (100 ml total) . The procedures for 
15 immunoscreening the library were the same as described 
above for Western blots. 

Immunoscreening of 6 x 10 5 pfu from the amplified 
library with the anti-a-Gal HMAF identified eleven 
plaques that were positive on duplicate filters. 
20 Following one round of plaque purification, five of the 
eleven were positive and were plaque purified a second 
time. The insert sizes of the five isolated phage were 
determined by PCR amplification using pUC forward and 
reverse sequencing primers. Four of the five clones 
25 contain an insert of approximately 1500 bp, and the 

fifth clone contains an insert of approximately 700 bp. 

In. order to rapidly confirm the identity of the 
isolated clones, we used a degenerate oligonucleotide 
(comprising 256 sequences) corresponding to what we 
3 0 identified as the largest conserved amino acid sequence 
(Gly-Gly-Trp-Asn-Asp, GGWND) among the guar, yeast, and 
human a-Gal sequences. The GGWND oligo (primes the 
sense strand) was used in combination with a pUC 
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forward sequencing primer (primes the antisense strand) 
for PCR amplification of the 3' end of cDNA. Three 
clones were used as templates for PCR amplification, 
one 1500 bp insert clone, the 700 bp insert clone, and 
5 a randomly picked, non-immunoreactive clone. Assuming 
that the coffee-a-Gal primary structure is homologous 
to that of guar a-Gal and contains the GGWND sequence, 
we expected a PCR product of approximately 700 bp from 
clones not truncated at the 3 ' end . Both the short and 

10 long immunoreactive clones yielded an amplification 
product of the predicted size, while the randomly 
picked, non-immunoreactive clone did not. These 
results support the interpretation that the 
immunoreactive cDNA clones encode coffee a-Gal. If so, 

15 the 1500 bp inserts are of sufficient size to encode a 
40 kD protein and potentially represent full-length 
coffee a-Gal cDNA clones. The production of a PCR 
product by the 700 bp cDNA insert of a size 
indistinguishable from the PCR product generated by the 

20 1500 bp cDNA suggests that the shorter insert is an 
a-Gal cDNA truncated at its 5' end. 

Following two rounds of plaque purification, 
recombinant phagemids (pBluescriptSK +/-) were excised 
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from the lambda clones by coinf ection with helper phage 
as described by the manufacturer. Bacteria (XLl-Blue) 
containing rescued phagemids were single colony 
purified. Inserts were released from all five 
5 recovered phagemids after EcoRI-XhoI double digestion, 
and as previously determined by PCR analysis, four of 
the five inserts were of the same size, approximately 
1500 bp, and the fifth clone contained a smaller insert 
of approximately 700 bp. The five plasmids are 

10 referred to as p27agAl-l, p27agCl-l, p27orgEl-l, 

p27argHl-l (700 bp insert), and p27argH2-3. Restriction 
enzyme analysis demonstrated that the large inserts 
contain a unique Hindlll site approximately 500 bp from 
the 5' end of the insert and a unique PstI site 

15 approximately 3 00 bp from the 3' end. The short clone 
also contains a similarly positioned PstI site, but 
lacks the Hindlll site. Based on the PCR results with 
the GGWND oligo and the restriction enzyme analysis, 
the 700 bp clone appeared to represent a 5' truncated 

20 version of. the coffee ar-Gal cDNA, possibly formed by 
premature termination during the first strand cDNA 
synthesis . 

Example 6 

Nucleotide Sequence Determination and Analysis 
25 Nucleotide sequence of one of the 1500 bp inserts, 

clone p27agCl-l, was determined. We obtained double 
stranded sequence of this clone with the exception of 
180 protein-encoding nucleotides (540-720) and the 
untranslated leader, where sequence was obtained from a 
30 single strand. The other three clones containing 1500 
bp inserts were partially sequenced, and we observed no 
differences except for the position of the 5' end of 



WO 95/06478 



PCT/US94/09739 



- 28 - 

the cDNAs. To facilitate the sequencing of the clone 
p27aGCl-l, the unique Hindlll and PstI site were used 
to generate subclones. We used a combination of pUC 
forward and reverse primers and insert-specific primers 
5 to sequence p27agCl-l and the subclones. The 
sequencing strategy is shown in Figure 4 . 

The complete sequence and deduced amino acid 
sequence is shown in Figure 1. The 6 cysteines 
potentially involved in disulfide formation are 
10 underlined. The potential polyadenylation signal (AAT 
AAA) is underlined. The potential glycosylation site 
is overscored by +++. 
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Claims 

1 . A composition of DNA molecules consisting of 
DNA molecules encoding coffee bean a-galactosidase. 

2 . A DNA encoding coffee bean a-galactosidase 
5 wherein the sequence encoding said a-galactosidase is 

free of introns. 

3 . The DNA of claim 1 wherein said encoding DNA 
encodes a-galactosidase of the amino acid sequence 
shown as amino acids 1-363 in Figure 1 or a fragment 

10 thereof or an amino acid sequence encoded by an allelic 
variant of the nucleic acid sequence of Figure 1. 

4. A recombinant expression system capable , when 
contained in a host cell, of expressing a DNA encoding 
coffee bean a-galactosidase , which expression system 
15 comprises a DNA encoding a-galactosidase having the 
amino acid, sequence numbered 1-363 in Figure 1, * or an 
enzymatically active fragment thereof, or an allelic 
variant thereof operably linked to control sequences 
compatible with said host cell. 

20 ■ 5 . A host cell modified to contain the 

expression system of claim 4. 

6. A method to produce coffee bean a- 
galactosidase, which method comprises culturing the 
cells of claim 5 under conditions wherein said encoding 
25 DNA is expressed to produce said a-galactosidase; and 

recovering said a-galactosidase from the culture. 
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7. Coffee bean a-galactosidase recombinantly 
produced by the method of claim 6. 

8, Coffee bean a-galactosidase in purified and 
isolated form having the amino acid sequence encoded by 

5 the DNA corresponding to amino acid positions 1-363 of 
Figure 1 or an allelic variant thereof. 

9- . A method to convert human red blood cells 
containing B antigen to red blood cells wherein said B 
antigen has the transfusion characteristics of Type O, 
10 which method comprises contacting said red blood cells 
with the a-galactosidase of claim 8 under conditions 
wherein the terminal galactose residue of said B 
antigen is removed. 

10 . Antibodies specifically immunoreactive with 
15 the coffee bean a-galactosidase of Claim 8 . 

11. A composition of antibodies consisting of 
antibodies specifically immunoreactive with the coffee 
bean a-galactosidase of Claim 8. 

12. The composition of Claim 11 wherein said 
20 antibodies are monoclonal antibodies. 

13. An oligonucleotide which is the complement of 
the nucleic acid of Figure 1 or of a portion thereof. 

14. An oligonucleotide capable of forming a 
triplex with a double stranded DNA comprising the 

25 nucleic acid sequence of Figure 1 and its complement. 
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15 . An oligonucleotide which is the complement of 
at least a portion of an mRNA encoding coffee bean 
ar-galactosidase or of the sense strand of the DNA 
region which affects production of said o?-Gal. 

5 16 . An oligonucleotide capable of forming a 

triplex with a double stranded DNA comprising the DNA 
region which effects expression of the protein -encoding 
region of the DNA of Figure 1. 
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